Estimating Self-Sustainability in Peer-to-Peer Swarming Systems 
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Abstract 

Peer-to-peer swarming is one of the de facto solutions for distributed content dissemination in today's Internet. By 
leveraging resources provided by clients, swarming systems reduce the load on and costs to publishers. However, there 
is a limit to how much cost savings can be gained from swarming; for example, for unpopular content peers will always 
depend on the publisher in order to complete their downloads. In this paper, we investigate this dependence. For this 
purpose, we propose a new metric, namely swarm self-sustainability. A swarm is referred to as self-sustaining if all its 
blocks are collectively held by peers; the self-sustainability of a swarm is the fraction of time in which the swarm is 
self-sustaining. We pose the following question: how does the self-sustainability of a swarm vary as a function of content 
popularity, the service capacity of the users, and the size of the file? We present a model to answer the posed question. 
We then propose efficient solution methods to compute self-sustainability. The accuracy of our estimates is validated 
against simulation. Finally, we also provide closed-form expressions for the fraction of time that a given number of 
blocks is collectively held by peers. 

1 Introduction 

Peer-to-peer swarming, such as used by Bit Torrent [B], is a scalable and efficient way to publish content in today's 
Internet. Peer-to-peer swarming has been widely studied during the last decade, and its use by enterprises is steadily 
growing [2,24.31,32]. By leveraging resources provided by clients, peer-to-peer swarming decreases costs to publishers, 
and provides scalability and system robustness. As demand for content increases, system capacity scales accordingly, as 
all clients collaborate with each other while downloading the desired content. As the demand for multimedia files and the 
size of these files increase, peer-to-peer swarming systems have become an important content dissemination solution for 
many content providers [2"ll32|. 

However, there is a limit on how much savings can be gained from swarming techniques. For example, in the case of 
unpopular content, peers must rely on the publisher in order to complete their downloads. In this paper, we investigate 
such a dependence of peers on a publisher. 



A swarm is a set of peers interested in the same content (file or bundle of files [16]) that exchange blocks of the files 
among themselves. We consider a scenario where each swarm includes one stable publisher that is always online and ready 
to serve content. The corresponding system is henceforth referred to as a hybrid peer-to-peer system, since peers can always 
rely on the publisher if they cannot find blocks of the files among themselves. If all blocks are available among the peers, 
the swarm is referred to as self-sustaining. Quantifying swarm self-sustainability, defined as the fraction of time during 
which the swarm is self-sustaining, is useful for provisioning purposes. The larger the swarm's self-sustainability, the 
lower the dependency of peers on the publisher, and the lower the bandwidth needed by the publisher to serve the peers. 

The primary contribution of this paper is a model to study swarm self-sustainability. We use a two-layer model to 
quantify swarm self-sustainability as a function of the number of blocks in the file, the mean upload capacity of peers 
and the popularity of a file. The upper layer of our model captures how user dynamics evolve over time, while the 
lower layer captures the probability of a given number of blocks being available among the peers conditioned on a fixed 
upper layer population state. Our model is flexible enough to account for large or small numbers of blocks in the file, 
heterogeneous download times for different blocks, and peers residing in the system after completing their downloads. We 
derive closed-form expressions for the distribution of the number of blocks available among the peers and apply them to 
show that self-sustainability increases as a function of the number of blocks in the file. The derived expressions involve 
sums and subtractions of large numbers, and are amenable to numerical errors. Hence, we present an efficient algorithm 
to compute the swarm self-sustainability that avoids these problems. We then numerically investigate the minimum 
popularity needed to attain a given self-sustainability level. Finally, we validate the estimates made by the model against 
detailed simulations. 

The remainder of this paper is organized as follows. After providing a brief background into swarming systems in Sj2j 
in $3] we propose our model. In Sj4] we present an efficient algorithm to solve the proposed model followed by analytical 
results in SJSJ In we evaluate our model against experiments. In ^7] we discuss some limitations and caveats of our 
model, $5] presents the related work and ^concludes the paper. 

2 Swarming Systems Primer 

A swarm is a set of peers concurrently sharing content of common interest. Content might be a file or a bundle of files that 
are distributed together. The content is divided into blocks that peers upload to and download from each other. Since 
there is no interaction between peers across swarms, each swarm can be studied separately. 

Bit Torrent is one of the most popular applications that uses peer-to-peer swarming for content dissemination, and we 
will use it to illustrate how swarming works. Unlike a traditional server-based system, Bit Torrent includes a tracker that 
promotes the interaction of participating peers. The identities of the trackers are announced to peers in torrent files, 
which can be found and downloaded through search engines such as Torrent Finder Q]. Peers periodically query the 
tracker to obtain a random subset of other peers in the swarm in order to exchange (upload and download) blocks with 
them. Peers also discover new neighbors from other peers, in addition to the tracker, when the Peer Exchange (PEX) 
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Figure 1: User dynamics. In stage h, there are rih users, each user owning h blocks, < h < B. 
extension is enabled. 

There are two kinds of peers in the system: seeds and leechers. Seeds are peers that have completed their downloads 
and only upload blocks. Leechers are peers that have not completed their downloads and are actively downloading (and 
uploading) blocks of the hie. Thus, leechers turn into seeds upon completing their downloads. Leechers adopt a tit-for-tat 
incentive strategy while downloading the hie, i.e., leechers preferentially upload content to other leechers that reciprocate 
likewise, and "choke" or ignore leechers that do not reciprocate. 

As many leechers may leave the system immediately after completing the download, content publishers often support 
a stable seed that we refer to as the publisher. A publisher is guaranteed to have all of the blocks constituting the hie. 
In the rest of this paper we assume that each swarm includes one publisher. 

Bit Torrent peers adopt the rarest hrst policy to decide which blocks to download from their neighbors. According 
to the rarest hrst policy, a peer prioritizes the rarest blocks when selecting the ones to download next. We say that 
a peer is interested in another peer if the latter can provide blocks to the former. Since rarest hrst guarantees a high 
diversity of blocks in the system, any peer is almost always interested in any other peer, which in general yields high 
system performance |13j . 

Note, however, that in Bit Torrent peers only have local information about the system. Hence, they can only implement 
a local rarest hrst policy. The intrinsic limit on the number of connections that a user can establish naturally provides 
each of them with only a myopic view of the system. Firewalls, NATs and other exogenous factors may also prevent users 
from establishing connections among themselves. 

3 Model 

In this section, we present our model to estimate swarm self-sustainability. Our model is hierarchical. The upper layer 
characterizes user dynamics, and the lower layer comprises a performance model used to quantify the distribution of blocks 
available among the peers, for a given state of the upper layer modelQ We present each of the layers, in £13.11 and £13.21 
respectively, and then introduce the metric of interest in §3.31 



3.1 User Dynamics Model 

A file consists of B blocks. Requests for a file arrive according to a Poisson process with rate A. We further assume that 
the time required for a user to download its j block is a random variable with mean l/fij, 1 < j < B. After completing 
their downloads, peers remain in the system for a time with mean I/7. 

We model user dynamics with (B + 1) M/G/oo queues in series. Each of the first B M/G/oo queues models the 
download of a block, and capture the self-scaling property of Bit Torrent swarms, i.e., each peer brings one unit of service 
capacity to the system. The last queue captures the residence time of seeds (see Figure [T]). 

The system state is characterized by a (B + l)-tuple, n = (n , ri\, n 2 , . . . , where represents the number of 
customers in queue h, i.e., the number of users that have downloaded h blocks of the file, < h < B. We denote by N the 
random variable characterizing the current state of the upper layer model and by n its realization. The number of peers 
in the system is referred to as n, n = ^2f— Ui - 

Peers arrive according to a Poisson process with rate A to queue and transit from queue h, also referred to as stage h, 
to queue h + 1 (0 < h< B — 1) with rate fih+i, the download rate of the (h + l) th block downloaded by a peer. The mean 
residence time in queue B captures the mean time that peers remain in the system after completing their downloads, 1/7. 
Setting 7 = 00 models the case where all peers immediately leave the system after completing the download. Throughout 
this paper, unless otherwise stated, we assume that the mean download times of all blocks are the same, = 
1 < j < B. Nevertheless, all results are easily extended to the case where the mean time it takes for a user to download 
its j th block is 1/ /ij . 

Let 7r(no, . . . , ns) be the joint steady state population probability distribution, 7r(rto, . . . , ns) = P(N = (no, . . . , ng)), 
of finding users in the h th queue, < h < B, and let TTh(nh) = P(Nh — rih), h = 0, . . . ,B, be the corresponding 
marginal probability. The steady state distribution of the queueing system has the following product form, 



where p = A/fj, is the load of the system (refer to Table [T] for notation). 
3.2 Performance Model For a Given Population State 

We now describe the lower layer of the model. Given the current population state, n = (no, . . ■ , tib), our goal is to 
determine the distribution of the number of blocks available among the peers. We begin by stating our key modeling 
assumption. 

Uniform and independent block allocation: In steady state, the set of blocks owned by a randomly selected 



x Each layer of the model is self-containted. In Appendix [K] we provide an alternate description of our model, integrating the user dynamics 
into the lower layer and explicitly capturing the evolution of the blocks owned by each user (users signatures) in time. 




(1) 




possibilities and independently among users. 



parameters 



A mean arrival rate of peers (peers/s) 

mean time to download a block (s) 
p = A/fi mean load of the system (per stage) 

B number of blocks in file 

I/7 mean residence time of seeds 



variables 
n h 

n=(n 0y . .,n B ) 
7r(n) 

n = Ei=o n * 

V 

metrics 

p{v) = P(V = v) probability of v blocks being available among peers 

A—p(B) swarm self-sustainability 



Table 1: Table of notation. Vectors arc denoted by bold face symbols. Unless otherwise stated, 7 = 00 in which case 
tib = 0. When referring to block availability, it is subsumed availability among peers (excluding publisher). 

A user u in stage h, < h < B, has a signature Sh, u <= {0, defined as a B bit vector where the i th bit is set to 1 if 
the user has block i and otherwise. Each user in stage h owns h blocks and has one of (•?) possible signatures. 

Under the uniform and independent block allocation, signatures are chosen uniformly at random and independently 
among users; the latter is clearly a strong assumption since in any peer-to-peer swarming system the signatures of users 
are correlated. Nevertheless, in we show that the effect of such correlations on our metric of interest, swarm self- 
sustainability, is negligible in many interesting scenarios. Therefore, we proceed with our analysis under such an assump- 
tion. 

Let Sh, u be the random variable denoting the signature of the u th user in stage h, and Sh, u its realization, 1 < u < n. 
The sample space of the lower layer model, f2 n , for a given state of the upper layer, n, is the set of all {0, l} Bn bit vectors 
in which element B(u — 1) + i equals one if the u th user has block i, and zero otherwise, l<u<n, l<i<B. An 
element in f2 n is the concatenation of n bit vectors of size B each. f2 n has cardinality \Q n \ = Y\f l=Q (^) h ■ Then, under 
the uniform and independent block allocation assumption, 



P(Sl,l=S ltl , . . . ,S B ,n B =S B ,n B \N - n ) = jjf| ( 2 ) 

In the next section, we relate the upper and lower layer models, showing how ([1]) and ([2]) yield the key metric of interest, 
namely, swarm self-sustainability. 

3.3 Self-Sustainability 

We now define the key metric of interest, swarm self-sustainability. Let V denote the steady state number of blocks available 
among the peers. Denote by p(v) the steady state probability that v blocks are available among the peers, 



number of users that own h blocks 
upper layer state 

steady state probability of state n 

number of peers in the system 

number of blocks available among the peers 



p( v ) = p(v =v)= p( y = v \ N = n K( n ) 

n6N B + 1 



(3) 



Definition 3.1. The swarm self-sustainability, A, is the steady-state probability that the peers have the entire file, 



A = p(B) (4) 

Definition 13.11 together with equation ([3]) yield, 

A= V P(V = BIN = n)7r(n)= V P(V = BIN = n) (A/7) " B e ' A/7 TT ( ^r^ P ) (5) 

The second equality in §5§ follows from ([T]). P(V = B\N = n) is obtained from ^ (see Appendix |A"|) . 

If peers leave the system immediately after concluding their downloads, we refer to the swarm self-sustainability as 
Aoo. The swarm self-sustainability, A, when 7 < 00, is obtained from A x as follows, 

A=l-(l-^ 00 )exp(-A/7), 7<°o (6) 

The above follows because a block is unavailable among the peers if it is unavailable among the leechers and there are no 
seeds in the system. 

Note that A is expressed through ([3]) as an infinite sum. In what follows, we approximate A by its truncated version, 
A^ N \ considering only population states in which there are no more than N users in the system, 

A (N) = J2 P(V - B\N = n)7r(n) (7) 

neti B s.t.n<N 

The value of N is based on the desired error tolerance 77, and is chosen as described at the end of |JH A naive use of (0 
yields an inefficient algorithm to compute A^ by exploring a number of states that grows exponentially with respect to 
the file size. This problem is addressed in the next section, where we provide an efficient algorithm to evaluate A^ N \ 

In the rest of this paper, we refer to the truncated self-sustainability, A^ N \ simply as self-sustainability, the distinction 
between A^^ and A being clear from the context. In addition, since A is readily obtained from Aoo using ([S]), henceforth 
we focus on the case 7 = 00 and make the dependence of p(v) on 7 explicit, denoting it by p(v; 7), whenever 7 < 00. 

4 An Efficient Solution Algorithm to Evaluate Self-Sustainability 

In this section we present an efficient algorithm to compute swarm self-sustainability in polynomial time. The key 
insight consists of aggregating the states in the upper layer of the model in such a way that the lower layer metrics are 
computed once per aggregate rather than once per state. The algorithm relies on three observations about our model, the 
first related to the performance model for a given population state (lower layer model) and the last two related to the user 
dynamics (upper layer model). 

Let iph(k,v) be the probability that, in a system in which k blocks are initially available among peers, v blocks 




t^x-x^s^ blocks initially available among peers 
Y///////////A blocks initially unavailable among peers 
contributed blocks 



Figure 2: Recursion to compute probability of v blocks being unavailable among the peers. There are initially k blocks 
unavailable among the peers, and v after a user contributes h blocks. 

become available among the peers after an additional user contributes h blocks. Then, tph{k,v) is characterized by a 
hypergeometric distribution, 

t k \ m-k\ 

Mk,v) = , k = 0,l,...,B-l,B, v = max(k,h),...,B-l,B (8) 

Equation ([8]) follows because there are (zZk) wa Y s m which v — k blocks of the additional user do not overlap with the k 
previously available blocks, and there are { h _, v _ k \) ways in which the other h — v + k blocks can overlap with previously 
available blocks (see Figure [2]). A recursion to compute iph(k,v) is presented in Appendix IB"1 

Our second observation regards the steady state probability that a randomly selected user is in stage h, denoted by 
<j(h) (0 < h < B — 1). It can be shown that <r(h) = 1/B, < h < B — 1. This is a consequence of the assumption that 
the download times of all blocks have the same mean, Note that, in general, if a user downloads its (h + l) th block 

at rate Hh+i, < h < B — 1, the model can be easily parameterized by setting o(h) = (l/M/i+i)/(SiLi V/ 1 *)- 

Our third and last observation concerns the total number of users in the system. The total population is characterized 
by a Poisson random variable with mean _BA//i. This follows from the fact that the sum of B independent Poisson random 
variables, with mean A//x, is a Poisson random variable with mean BX/fj,. 

Denote by p n {v) the probability that v blocks are available among the peers conditioned on the presence of n users in 
the system, 

Pn(v) = P(V = v\\n\=n) (9) 

It follows from the discussion in the previous paragraph that p(v) — Yl^=oPn{ v ) e Bp {Bp) n /n\. The truncated version of 
p(v), p {N} (v), is 

JV 

pW(v) = Y,Pn(v)e- Bp (Bp) n /n\ (10) 

n=0 

It remains to show how to compute p n (v). This is accomplished by making use of our first two observations, as summarized 
by the following lemma. 



Lemma 4.1. The probability of v available blocks, conditioned on n users in the system, p n {v), satisfies the following 



recursion, 



min(t;,S — 1) y 

Y Y Pn-i(k)Mk,v)/B, n>l 



Pn{v)= < 



h—0 k—v — h 



(ii) 



n = 0,v = 
n = 0,v^O 

In Appendix [Cl we show that (ITT1) correctly computes (|9]). Next, we further simplify recursion (jlip . Changing the 
order of the summations and adapting their limits accordingly yields 



.W^^P^iW ^ $ h (le,v)/B, n>l 



k=0 



h—v — k 



The base cases are po(v) — l if u = and po(v)=0 if m ^ 0. In Appendix IDl we derive the following result, 



(12) 



B-l 



(B + l)/(B-k + l), 0<m<B-l 
k/(B-k + l), v = B 



Y ^ h ( k ' 



Equation (fT3|) is key to further simplifying (fl"2j) . Replacing (fT3|) into (fT2|) yields, after algebraic manipulation, 
Theorem 4.1. The probability of v blocks being available among the peers, p{v), equals 



(13) 



p(v) = YPn(v)e- Bp (B P rM 



(14) 



where p n {v) satisfies the following recursion, < v < B , 



Pn{v)= < 



1/B", n>l,« = 

Pn(« - 1) +p n - 1 (v)((B + 1)/{B(B -v+ 1))), n > 1,B > v > 

1, n = 0,v = 

0, n = 0, v ^ 



(15) 



and p n {B) = 1 — X^Lo* Pn{ v )- The approximation p^ N \v) for (|14l) is obtained by truncating the infinite sum at N and is 
computed in time 0{NB). 

Theorem 14.11 yields an efficient algorithm to evaluate swarm self-sustainability. The algorithm has complexity 
O(NB), since p n (v) is computed for < n < N and < m < B. Note also that once the elements p n {v) are computed 
for a fixed B, one can obtain the self-sustainability for different values of p in time O(B). 

Let e(N) be the truncation error, e(N) = p{B) — p^ N \B). The maximum number of users in the system, N, can be 
chosen as a function of e(N), 



N 



e(N)= Y p{v)e- Bp {Bp) n /n\< Y eT Bp {Bp) n /n\ = 1 - Y e' Bp {Bp) n /n\ 

n=N+l n =N+l ri=0 



(16) 



If pB is large (pB > 1000), the Poisson distribution is well approximated by a normal distribution. In this case, N can be 
chosen so that 1 — $((iV — Bp)j\JBp) < 77, where 77 is the desired error tolerance and $(■) is the standard normal cdf. 

Theorem 14.11 assumes ^t/, = fj,, < h < B — 1. If that is not the case, self-sustainability can be computed in time 
0(NB log B) using an alternative recursion. We refer the reader to Appendix [Gl for details. 



5 Model Analysis 

We derive closed-form expressions for the probability that v blocks are available among the peers in the system and for 
the mean number of available blocks, in H5.ll and M5.21 respectively. The closed-form expressions are useful in order 
to gain insight on how different system parameters impact self-sustainability. In !j5.2l we use the closed- form expressions 
to compute the minimum popularity to attain a given self-sustainability level. In we show that self-sustainability 
increases with file size. However, the closed-form expressions may lead to numerical problems, if used to compute the 
self-sustainability for large files (e.g., B > 500), since they involve sums and subtractions of large numbers. This is why 
the recursion presented in <0]is useful. 

In order to simplify the closed- form expressions, in the remainder of this section we assume that 7 = /1, i.e., peers, 
after completing their downloads, linger in the system as seeds for an interval with duration drawn from an exponential 
distribution with mean 1/7. Recall that when 7 is finite, we make the dependence of p n (v) on 7 explicit, and denote it by 
Pn(v;j). 

5.1 Self-Sustainability Closed-Form Expression 

Similar arguments to those in S}4] (Theorem 14. 1[) yield, for 7 = p, and 0<n<N,0<v<B, 



Pn(v;p)= < 



1/(5+1)™, n> l,t! = 

Pn(v- l;jJ,)+p n -i(v;fj,)(l/{B -v + 1)), n>l,0<v<B 

1, n = 0,u = 

0, n = 0, v ^ 



(17) 



This recursion can be solved (see Appendix |H|) . to obtain 

p n ( U ;/i)=( jB )^^)(-iy(S- U + Z + l)- n , l<n, 0<v<B (18) 



In particular, the probability that all blocks arc available among the peers, conditioned on the number of users in the system, 
is p n (B;p) — + 1) — Using this expression we derive, in Appendix U the corresponding unconditional 



2 The expression corresponding to equation H18H for the case 7 = 00 is found in Appendix [F] 



probability, namely, the swarm self-sustainability, 

p(B; M ) = f; f f ) (_l)' e -(B+i)pVa+i) = 1 - £ (f ) (-l^e-^^/Cm) (19) 

We now interpret (|19[) using the inclusion/exclusion principle, which allows us to apply the Bonferroni inequalities in H5.2I 
and £15.31 The term exp(— (£? + l)pl/(l + 1)) is the probability that I specific (tagged) blocks are unavailable among 
the peers (refer to Appendix [J] for the derivation). So, f?) exp(— (£> + \)pl/(l + 1)) is the sum of the probabilities that 
any I blocks are unavailable among the peers. Therefore, as a consequence of the inclusion/exclusion principle, the 
probability that at least one block is unavailable among the peers equals the rightmost summation in (|19|) . and p{B; fj,) 
is its complement. 

In what follows, we use the above closed-form expression to analyze the mean number of blocks unavailable among the 
peers as well as the impact of the file size on the self-sustainability. 

5.2 Minimum Load to Attain Self-Sustainability 

We now provide a simple expression to estimate the minimum load necessary to attain high self-sustainability. The result 
relies on approximating swarm self-sustainability using the mean number of available blocks among the peers. The mean 
number of available blocks among the peers, E[V], is E[V] — B(l — q), where q is the probability that a tagged block is 
unavailable among the peers, 

? = eaq>(-p(B + l)/2) (20) 

The expression of q is readily obtained from (|19|) . Note that the mean number of unavailable blocks, Bq, equals the 
first term (I = 1) in the rightmost summation in (|19p . This observation coupled with an application of the Bonferroni 
inequality [33] to (JTUJ) implies that 1 - p(B; p) < B - E[V}. When E[V] ss B, the upper bound B - E[V] provides an 
approximation to the fraction of time that the swarm is not self-sustaining, 1 — p(B; p). 

Next, we present a simple alternative derivation of (j2T)]) . Let be the probability that a tagged block is unavailable 
among leechers (excluding seeds). In order to compute <7oo, note that the mean time that a leecher holds a tagged block 
is Yld=o l/i^B) — (B — l)/(2/i) and the rate at which leechers acquire a tagged block is given by a Poisson process with 
mean rate A. Therefore, q^ = exp(— p(B — l)/2). In general, a tagged block is unavailable among the peers if no 
leecher owns the block and there are no seeds in the system. The probabilities of these two events are exp(— p(B — l)/2) 
and exp(— p), respectively. Their product yields (|20p. 

We now use the above results to provide a simple expression to estimate the minimum load, p* , necessary to attain a 
given self-sustainability level, A*, when 7 = 00. It follows from the discussion in the previous paragraph that if 7 = 00 
the probability that a tagged block is unavailable among the peers is qoc — exp(—p(B — l)/2). For values of qoc close to 
(<7oo < 0.01), p(B) « 1 + E[V] — B = 1 — -B<7oo, as indicated in the beginning of this section. This approximation, in 



turn, can be used to select the load p* to attain self-sustainability level A*, 



p*tt[21ag(B/(l-A*))]/(B-l), 7 = ^ (21) 

We further study (|21|) in i)6.31 where we compare the results obtained with this approximation against those obtained with 
recursion (fT5|) , that provides exact results. 

5.3 The Impact of File Size on Self-Sustainability 

Increasing file size, B, increases the mean download time of peers, B / p. In this section, we show that such an increase 
in the residence time of peers yields larger swarm self-sustainability. Theorem 1 5 . 1 1 states the result for 7 = p. B > 4 
and p > 1.6 and in jJHwe provide evidence that it also holds when 7 = 00 and for small values of p. 

Theorem 5.1 (File Size Impact). If B > A, p > 1.6 and 7 = p, self-sustainability increases with the file size, B. 

Proof. We denote the swarm self-sustainability for a given value of B and p as p(B, p). The Bonferroni inequalities [33], 
which generalize the inclusion/exclusion principle, applied to (|19[) . yield upper and lower bounds onp(B,p), 

1 - Be-^+iW 2 < p(B, p)<l- Be-^+^l 2 + (B(B - 1) / 2 ) e - 2 ( B + 1 )"/ 3 (22) 

It is easy to show that if p > 1.6 and B > 4 then I- Be- {B+1 ^/ 2 + {B{B - l)/2) e -( B+1 ^ 2 / 3 < 1 - (B + l) e - ( - B + 2 ^/ 2 , 
from which the result follows by comparing p(B, p) and p(B + 1, p). □ □ 

The key insight of Theorem I5.1l can be easily explained in terms of the busy periods of the proposed model. The busy 
period is defined as an uninterrupted interval during which the swarm is self-sustaining. As the file size increases, the num- 
ber of blocks that need to be maintained increases linearly but the busy period of the system increases exponentially |16j . 
Indeed, as the file size increases the availability gain compensates the overhead to maintain a larger number of blocks, and 
the self-sustainability increases. 

6 Evaluation 

In this section we report (a) a validation of the proposed model, against detailed simulations, showing that despite 
the simplifying assumptions considered in our model, it captures how self-sustainability depends on different system 
parameters and (b) results on the minimum popularity to attain a given self-sustainability level. 

6.1 Experimental Setup 

Our simulation experiments were conducted using the Tangram-II modeling environment [8]. Tangram-II is an event- 
driven, object oriented modeling tool. The three main objects in our simulations are the tracker, the peer and the 



seed. Their implementations are based on the official Bit Torrent protocol description [6, 13 . Every time a peer enters 
the system, receives a block or leaves, we record the event in our logs, the current timestamp, peer id, and signature (see 

6.1.1 Simulator and Protocol Descriptions 

When a peer P joins the system, it receives a random list of fifty other peers from the tracker, which constitutes its peer 
set. Throughout the simulation, as peers leave the system the size of the peer set of P may dwindle to less than twenty. 
Once the peer set size is less than twenty, P requests additional neighbors from the tracker. The set of peers to whom P 
offers content blocks is a subset of its peer set, referred to as the active peer set. 

BitTorrent proceeds in rounds of ten seconds. By the end of each round, peer P runs the tit-for-tat incentive mechanism. 
According to this mechanism, P reciprocates contents with those neighbors that contributed in the previous round. P 
selects r of those peers that contributed in the last round to add to its active peer set (r < 4). In the next round, the 
active peer set of P will consist of the r aforementioned peers plus 5 — r additional peers selected uniformly at random 
out of its peer set. This random selection of peers is referred to as optimistic unchoke, performed to allow peers to get 
bootstrapped as well as to let them learn about new neighbors. Finally, peers select blocks to download using the rarest 
first algorithm, except for the first four blocks, which are chosen uniformly. Each block of the file is divided into sixteen 
sub-blocks. After selecting the blocks to download, peers can get different sub-blocks (of the same block) from different 
neighbors concurrently. A block can be uploaded after all its sub-blocks are downloaded, assembled and checked using the 
block hash key. 

In our experiments we observe that self-sustainability decreases with the size of the active peer sets. That is because, 
if the active peer sets are large, each peer splits its bandwidth across many other peers and blocks take longer to get 
replicated in the network. As mentioned above, we select an active peer set of size five, which is adopted by many 
BitTorrent implementations. 

In our experiments the seed behaves as a standard peer, except that it (i) initially owns all blocks and (ii) is altruistic, 
hence does not execute the tit-for-tat algorithm. We did not implement the mechanism used by peers to download 
their last block, also known as end game mode [6]. This is inconsequential, though, since the end game mode does not 
significantly affect the steady state behavior of the system (see §6. 2. H and [4]). 

6.1.2 Experimental Parameters 

The configuration of our experiments consists of torrents that publish a file of size S divided into B blocks of size s, 
s = 256KB, a typical block size in BitTorrent. The number of blocks in the file, B, takes values 16, 50, 100 and 200, which 
corresponds to files of size 4MB, 12MB, 25MB and 51MB, respectively. Note that if a swarm is constituted of multiple 
files which can be downloaded separately, we arc interested in analyzing the self-sustainability of each individual file. A 
file of size 51MB already yields self-sustainability larger than 0.9 for A > 0.05 peers/min (see Figured]). Simulations to 

3 Our Tangram-II model as well as the traces generated for this paper are available at http://www.land.ufrj . br/~arocha/self sustain 



analyze such a steep increase in self-sustainability (see Figure |4|) quickly requires prohibitively large execution times and 
significant variability in the metrics of interest across runs. For this reason, we focused on quantitatively validating our 
model for files with up to 200 blocks, but also use the model to analyze files of size greater than 200 blocks. 

The uplink capacity of each peer is 39KBps, which corresponds to /i = 39/256 = 0.15 blocks/s, a typical effective 
capacity for Bit Torrent peers |23l Figure 1]. The publisher maximum upload capacity is the same as that of a peer. 
A publisher that contributes the same capacity as an ordinary peer might correspond to either a domestic user or a 
commercial publisher that supports a large catalog of titles and only provides enough capacity for each swarm so as to 
allow peers to complete their downloads at rate /i. The peer arrival rate is varied according to the experimental goals 
between (.25, .5, 1, 2, . . . , 9) peers/min, as described next. 

6.2 Model Validation 

Our analytical model makes a number of simplifying assumptions as described in Ej3j In what follows, our goal is to 
show that even with those simplifying assumptions, discussed in £)6.2.11 our model still captures swarm self-sustainability 
in a realistic setting, as shown in H6.2.21 

6.2.1 Validating Model Assumptions 

Our aim in this section is to validate (a) that the mean download times of blocks are roughly the same (an exception being 
the first block downloaded by the peers), i.e., /i^ = n, 2 < h < B (see £13. ip and (b) that the signatures of the users are 
uniformly distributed (see £ 33. 2j) . 

Figure [3ja) shows the download time of the h th block downloaded by a peer. The boxplots and lines show the 
distribution quartiles and minimum and maximum values. Crosses indicate means download times of blocks, which are 
approximately the same, except for the first and last blocks. In particular, even though our simulator ignores the 
end game mode [4], in general peers do not experience difficulty finding a neighbor from whom to download their last 
block. The median of the last block is roughly the same as the one observed for the other blocks, and the mean is 
only slightly larger. The first block requested by a peer, however, takes longer to be downloaded. This happens because 
a peer can only download its first block after being optimistically unchoked (see £16. 1 . 1|) . Although this affects the time 
spent by peers in stage zero of our model and as a consequence the total download time, it is inconsequential to our 
self-sustainability estimates (the time that peers remain in stage zero of the upper layer model has no influence in our 
results). While Bit Torrent peers download their first block, they cannot contribute to self-sustainability as they have no 
content to provide (see §6.1.ip . 

Our second goal is to study the users' signatures distribution ( §3.2|) . For this purpose, we validated that the first two 
and last two blocks downloaded by a user are indeed uniform and then studied one of the consequences of the uniform 
and independence assumption, namely, that the number of replicas of each block in the system is well balanced. 
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Figure 3: (a) Download times of h th block downloaded by a peer. The boxplots show the four quartiles and crosses 
indicate means. (b) Top: shaded (resp., light) rectangles are fractions of peers that download block x as their first 
(resp., last) block. Bottom: fraction of peers that download block-pair (x\,x%) as their last two blocks, (c) Mean number 
of replicas of each block with 95% confidence intervals, (d) Coefficient of variation of number of replicas of blocks versus 
time. 



Figure ^b) (top) shows, for each block, the fraction of peers that downloaded that block as their first block (shaded 
bars). The figure was generated from independent samples: every 500 seconds, one user owning one block was randomly 
selected, and the identity of its block was recorded (the same procedure was repeated for users owning all but one block 
of the file (light bars)). Similarly, Figure [3th ) (bottom) shows, for every B(B — 1) block-pair, the fraction of peers that 
downloaded that pair as their first two blocks. Figure EJb) indicates that the first and last blocks downloaded by users, 
as well as the first downloaded block-pair, are approximately uniformly distributed (the same procedure was repeated for 
users owning all but two blocks, with similar results). 

Figure [3jc) shows the mean number of replicas of each block (including the one stored at the publisher) for A = 
1 peer/min, 4 peers/min and 7 peers/min. The mean number of replicas of each block is around p{B — l)/2 + 1, which 
corresponds to a well balanced system (see £ i5.2j) . Figure EJd) corroborates this claim by showing the coefficient of variation 
of the number of replicas of blocks as a function of time. Let r%t be the number of replicas of block i at time t. The mean 



number of replicas of blocks at time t is /X{ = Y2f=i r i,t I B an d the coefficient of variation is ct — y (22=i ( r *>* — ^t) 2 / B) / ^t- 
Figure ISJd) indicates that throughout the simulation, the coefficient of variation is usually smaller than 0.8, which means 
that the number of replicas of blocks has a low variance. In a system where users signatures are uniform and independent 
we would observe similar behavior. 
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Figure 4: Model validation. Swarm self sustainability as a function of the system load. Results obtained with recursive 
algorithm and simulations (with 95% confidence intervals) are plotted with dotted and solid lines, respectively. 

6.2.2 Validating Model Estimate of Self- Sustainability 

To study how content popularity impacts self-sustainability, we simulated Bit Torrent in the setting described in §6.1.21 
varying the arrival rate of peers, A, from 1 peer/minute to 8 peers/minute, in increments of 1 peer/minute, while main- 
taining all other parameters fixed. Equivalently, this corresponds to an increase in the load, p — Xs/fi, from 0.1 to 0.8. 
Each simulation lasted for 10,000s. Twenty one independent simulations were executed for each value of A, and used to 
compute 95% confidence intervals. The same experiment is repeated for B = 16, 50, 100 and 200. 

Figure |4] shows self-sustainability, A, as a function of the content popularity, A, for B = 16, 50, 100 and 200. For 
unpopular contents, A = 1 peer/min, and small files, B = 16, swarm self-sustainability is around 0.1 and the publisher 
needs to frequently provide blocks that are unavailable among the peers. As the popularity of the files increases, swarm 
self-sustainability increases and content is available even in the absence of the publisher. For A = 8 peers/min, the fraction 
of time at which the publisher needs to provide blocks to peers is close to zero. 

Figure 0] indicates that the results obtained with our model are close to those obtained through simulation. Even 
assuming that the mean download times of all blocks are the same (fih = fj,, for 1 < h < B), the model was able to 
capture the self-sustainability observed in our simulations. We also repeated the simulations with heterogeneous peer 
upload capacities, for B ^ 16 (the upload rate distribution taken from the measured data used to generate Figure 1 in the 
Bit Tyrant study [23]) and our results did not qualitatively change (details in |17|). 

Consider now the impact of file size on swarm self-sustainability. Figure [4] shows that for a fixed content popularity, as 
file size increases, self-sustainability increases. This is in accordance with Theorem l5.ll and reflects the fact that, as file size 
increases, peers stay longer in the system and the coverage, defined as the mean number of users in the system, increases. 
The higher the coverage, the greater the self-sustainability of the swarm. In fact, as file size increases, the number of blocks 
that needs to be maintained by the publisher increases linearly but the availability gain increases exponentially [16] . 
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Figure 5: Larger files yield increased availability, (a) x-axis, file size, y-axis, necessary load (p* = X/p) to attain self- 
sustainability greater than 0.9 (red) and 0.999 (blue). The results obtained using the approximation ([21]) are also shown. 
(b) the mean number of users in the system, Bp* , to attain a desired self-sustainability level. 



6.3 Popularity to Attain High Self-Sustainability 

We now address the following question: what is the minimum file popularity (or load) needed to attain a given self- 
sustainability? Answering this question is useful not only for publisher dimensioning but also for other strategic decisions 
such as how to distribute and bundle files across multiple swarms [16] . 

Recall that the load is defined as p = X/p. Figure [5] shows the minimum load, p*, necessary to achieve high self- 
sustainability, A* (A* = 0.9,0.999), for file sizes varying between 2MB and 256MB (B = 8, . . . , 1, 000). This figure 
illustrates self-sustainability as predicted by the recursions contained in Theorem l4.11 eq. (|15[) and the approximation (1211) . 
Figure OJa) suggests that when the goal is to find the minimum popularity to attain a high self-sustainability level, 
equation ()21|) can be used to approximate p*. However, if the goal is to compute self-sustainability under different loads 
and for different files sizes, as illustrated by Figure [4j the recursion provided by Theorem 14.11 needs to be used. 

Figure EJa) indicates that the popularity, p* ', needed to attain a high degree of self-sustainability increases as the file 
size, B, decreases. In particular, the zoom in the figure shows that log p* is linear in log B. The comments made at the end 
of §6.21 to explain Figure [4] also apply here. Peers take longer to download larger files, which increases block availability. 
Nevertheless, the benefits of leveraging peer-to-peer swarming can be noted even for small files. Figure [5^a) shows that 
for a file of 4MB (B = 16), an arrival rate of 8 peers/min (which corresponds to a load of 0.8) already yields a very high 
self-sustainability. 

More insights on how file size impacts self-sustainability are obtained from Figure[5jb). Figure[5jb) plots the mean num- 
ber of users in the system, also referred to as the mean coverage [TH] , necessary to attain a high level of self-sustainability. 
The curves in Figure EJb) correspond to the the respective ones in Figure Efa) multiplied by B. The main insight shown 
in this example is that the coverage necessary to attain a given self-sustainability level slowly increases as a function of 
B. As the file size increases, a slightly larger population suffices to attain a given self-sustainability level. For instance, 
for a file of size 10 a coverage of 20 is necessary to attain self-sustainability of 0.999, whereas a coverage of 29 suffices to 



achieve the same self-sustainability if the file has 1,000 blocks. 

7 Discussion 

Next, we discuss the assumptions adopted to yield a tractable model. 

Uniformity and independence assumptions: In the performance model presented in £13.21 we assume that the 
signatures of users are drawn uniformly and independently at random. In particular, we do not account for correlations 
among users' signatures. Although such correlations are present in practice, our simulations have indicated that the 
independence assumption is appropriate in order to capture the self-sustainability of swarms. 

In the user dynamic model presented in Ej3.ll we make the following assumptions: (i) peers arrive according to a 
Poisson process with rate A (steady state assumption), (ii) the download times of all blocks have the same mean, 1/fj, 
(smooth download assumption) and (m) users leave the system immediately after completing their downloads, 7 = 00 
(self-regarding users assumption). We discuss each of these in turn. 

Steady state assumption: It has been shown in [161 Section 4.3.4] that a vast number of long-lived swarms have 
relatively stable mean arrival rates over periods of months. Our model can be used to predict the self-sustainability of 
such swarms. 

Smooth download assumption: Under this assumption, the capacity of the system scales perfectly with the 
number of users. Our simulations indicate that the mean download time of the first and last blocks are slightly larger than 
the others. Although our model has the flexibility to capture such asymmetries (see observation two in Q, we show that 
their implications in the estimates of self-sustainability are not significant (see £16.2.11) . 

Self-regarding users assumption: Our model has the flexibility to account for users that stay in the network after 
completing their downloads. However, in today's Bit Torrent users have no incentive to stay in the system after obtaining 
the files of their interest. Therefore, we focus on the worst case scenario in which users, not having incentives to linger in 
the system after completing their downloads, depart immediately. 

Finally, in our simulations we consider a publisher that is always online and that behaves like a typical peer. 
Typical peer-like publisher: Our simulations indicate that if the publisher has the same capacity as a typical peer, 
the smooth download assumption holds and the swarm self-sustainability estimates of our model are accurate. Coping with 
intermittent publishers and devising dynamic bandwidth allocation strategies according to which the smooth download 
assumption holds is non trivial |10j . and is subject of future work. 




8 Related Work and Discussion 

Modeling of Peer-to-Peer Swarming Systems 

The literature on availability [20], performance [25] and incentive issues [23^ in Bit Torrent-like swarming systems is vast. 
Nevertheless, to our knowledge this paper presents the first analytical model for publisher dependency estimation. We 




are unaware of related analytical work that analyzed swarm self-sustainability taking into account the fact that files are 
divided into multiple blocks, a very fundamental characteristic of these systems. 

For large populations, Massoulie and Vojnovic [TS] used a coupon collector model to show that asymptotically the 
distribution of blocks across the population is well balanced, and does not critically depend on the block selection algorithm 
used by the peers. Qiu and Srikant [35] and Fan et al. [5] also considered the large population regime, and used fluid 
approximations and differential equations to model the system assuming that the efficiency is always high. In this paper 
we are particularly interested in the small population regime. For small populations, Markov Chain (MC) models have 
been proposed by Yang and Veciana [36], providing insights on the performance of the system but not dealing with the 
problem of block availability among peers. Norros and Reittu [26 , using a different model, studied the dissemination of a 
two-block file in a closed network accounting for the availability of the blocks. In this paper, we consider an open network 
and propose a model which can be used to estimate self-sustainability of files of arbitrary size. 

In this work we studied the implications of the content popularity on the self-sustainability of swarms. In a real 
time setting, Leskela et al. [14] pointed out a phase transition in the stability of the peer-to-peer system as a function of 
the content popularity. In contrast, our system is always stable. Norros et al. [21) imply a phase transition of the mean 
broadcast times as a function of the departure rate of seeds. In this paper we are concerned with self-sustainability. 

Hybrid Peer-to-Peer Swarming Systems 

The literature on the use of peer-to-peer swarming systems for enterprise content delivery is rapidly growing [Tl l22l[2"T] . 
The methodology usually consists of defining an optimization problem to be solved by the publishers and then showing 
how different system parameters affect the optimal bandwidth allocation strategy. The approach we take in this paper is 
different. We are interested in the minimum fraction of time that the publisher must be active so as to guarantee that 
all blocks are always available. 

Ioannidis and Marbach study how quickly the bandwidth available at the server has to grow as the number of users 
increases. For this purpose they consider two query propagation mechanisms, the random walk and the expanding ring. 
Here, on the other hand, we assume that peers can always find the blocks they need in case they are available. While [TT] 
focuses on the control plane and it's asymptotic analysis, here we focus on the data plane and account also for small files. 

Mcnasche et al. [16], Wong et al. [35] and Susitaival et al. [30] propose models for content availability in Bit Torrent 
without accounting for the fact that files are divided into blocks. Our model differs from [16l[30l[35] in that we (a) 
consider a hybrid peer-to-peer system, in which a publisher is always available and (b) account for the fact that the file is 
divided into blocks. Finally, explicit scheduling of blocks exchanges to minimize peer download times was studied by 
Mundinger et al. |19j . In this paper we assume that peers exchange blocks using only local information, as in Bit Torrent. 



Balls and Bins 

The derivation of some of our results fit into the balls and bins framework. Each user is allocated a set of blocks (balls) 
each of which must correspond to a different identifier (bin). In the context of balls and bins, a set of balls each of which 
must be allocated in a different bin is referred to as complex |12j . Previous work on the allocation of complexes into bins 
appears in Mirakhmedov and Mirakhmedov [18], Kolchin et al. [HI Chapter VII], and references therein. In particular, 
the definition of iph{i,m) in this paper was inspired by 5', Figure 1]. 

In a peer-to-peer setting, balls and bins were used by Simatos et al. [29] to study the duration of the regime during 
which the system is saturated because capacity is smaller than demand. The scenario studied in this paper differs from 
[2"9"] in several aspects. For instance, [33] considers a finite population of peers. 

9 Conclusion 

Peer-to-peer swarming systems are a powerful tool for content delivery, as reflected by the immense popularity of Bit Torrent 
and the vast literature on the topic. However, most works in this area have focused on the dissemination of popular content, 
for which peer-to-peer systems are naturally suitable. In this work, we investigate the dependency of peers on a publisher 
that leverages peer-to-peer techniques for the dissemination of both popular and unpopular content. In particular, the 
latter deserve special attention, since unpopular content can represent a significant fraction of demand and revenue [3] . We 
believe that devising strategies for disseminating large catalogs of files leveraging peer-to-peer techniques is an important 
and interesting research area, and we see our model as a first attempt to shed light into the intrinsic advantages and 
limitations of peer-to-peer swarming systems for the dissemination of such catalogs. 
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A An Expression of P(V = B\N = n) 

We now show how to obtain P(V = B\N = n) from ([2]). Recall that, given an upper layer state n, f2 n is the lower layer 
sample space. Denote by S the random variable that represents the lower layer state, and by s its realization. Then 

P(y = B|N = n)= ^(S = s|N = n)i ^ l/|fi n | (23) 

Therefore, the problem of computing P(V = B\N = n) is reduced to that of counting the states in which all blocks are avail- 
able among the peers. Using the inclusion/exclusion principle, P(V = -B|N = n) = |ri n | _1 ^f^^ ) Il^Ti 1 ^J 1 )™ 3 ■ 
In general, P(V — v |N — n) is also obtained using the inclusion/exclusion principle [34l Section 4.2]. 

B Recursion to Compute i/jh(k,v) 

Consider the scenario where k blocks are available among the peers and an additional user contributes h blocks. ^phik,v) 
is the probability that v blocks are available among the peers after accounting for the blocks contributed by the additional 
user. iph(k,v) can be recursively computed, 

^ h {k, v) = V>fe-i(fc, v - 1) + i>h-i{k, v) B * hl , 0<k,v<B, 0<h<B (24) 

The base cases are ipo(k,v) = if v ^ k and ipo(k,v) = 1 if v = k. Note that the above recursion is convenient to avoid 
numerical problems since it only involves additions and multiplications of probabilities. 

For presentation convenience, we consider an arbitrary ordering of the h blocks contributed by the additional user. 
After contributing the first h — 1 blocks, there are two cases to consider, (i) v blocks are available among the peers [and 
the h th block overlaps with a previously available block, an event which happens with probability v/(B—h+l)] or (ii) 



v 



— 1 blocks are available [and the h block does not overlap with previously available blocks, an event which happens 



with probability (B—v—h)/(B—h+l)]. Cases (i) and (ii) correspond to the first and second terms in (jM)) . respectively. 

C Proof of Lemma 14.11 

We now show that (|11[) and © are equivalent. To this purpose, we consider an extended representation of the upper layer 
states. In such representation, the upper layer state, n', is defined as follows. 



Consider the users in the system ordered uniformly at random. The number of blocks owned by the i th user is denoted 
by nj. The upper layer state is represented by vector n'. The dimension of n' is 1 x n, where n is the number of users in 
the system. 

Note that n is inferred from n'. Let n;(n') be the number of users that have I blocks when the state is n'. Since any 
permutation of the users in the system is equally likely, the steady state probability of state n' is 7r(n'), 



B-l 



Tr(n') = n 



1=0 



n™i( n ') 

^ e~ p 



1 



ni(n')\ 

The steady state probability of state n', conditioned on the event that there are n users in the system, is 



Tr(n') 



Tr(n') 



1 



Evn':|n'|=„^( n ') e- B P{B P y/n\ B n Hfj 1 ni(n') 



(25) 



The first equality follows from the fact that a superposition of B Poisson processes with rate p is a Poisson process with 
rate Bp. 

The key idea of the proof consists of partitioning the state space into sets G/ 1 (n), < h < B — 1, < n. Set Gh(n) 
contains states in which there are n users in the system and n' n = h. Gh{n) is defined as Ghin) = {n' : n' n = h and < 
n'j < B — 1, 1 < j < n}. The set containing all states in which there are n users in the system, G(n), is 



G(n) = U^G/,(n) = {n' : < < B - 1, 1 < j < n} 
Note that Gft(n) is obtained from G(n — 1) by adding to each element of G(n — 1) a user that owns h blocks, 



(26) 



G h (n) = {(n', 0) + hl n : n' € G{n - 1)} 



(27) 



l n denotes a 1 x n vector in which all elements equal zero, except element n, which equals one. Let p(v\n!) be the 
probability of v blocks being available among the peers when the upper layer state is n'. If n' e Gh(n) (n > 0), 



p(v\ri) = ^2p(k\ri - hl n )i) h (k,v), n' e G h (n) 



k=0 



According to definition 



(28) 



Pn{v)-- 



n'eG(n) 



n'eG(n) 



E p( v \ n ') 



n'GG(n) 

where the first and second equalities follow from (J2HJ) and (|2"5j) . respectively. Hence, 



B-Uto'niin'y. 



(29) 



B-l 



B-l 



°n(v) = E E PM n, ) RnTTB - 1 i , m, = E E 5^p(*|n'-W„)V»fc(fc,t;) 

ll/=0 n H n J- ft=0 n'GG h (n) fc=0 



h=0 n'eG ft (n) 



(30) 



B-l B , , B-l B 



h=0 fe=Om'£G(n-l) 1 li=o ""Z " fc=0 fc=0 



p„-i(fc) 

where the first, second and third equalities follow from (|26p . (|28p and ([27]), respectively. Note that the summands in (j3"Tj) 

for which h > v or k £ [0, i> — ft, — 1] U [w + 1, B] are equal to zero, since in these cases iph(k, v) = 0. Therefore, (|3"Tj) 
yields dTTJ. 



D Probabilistic Derivation of ( TT3I ) 



Next, we provide a probabilistic derivation of (|13p . An algebraic proof can be obtained using the Sigma package [28] . 
applying a paradigm called creative telescoping [37] . 

Henceforth, we consider the case v < B (the case v = B follows similarly). The probabilistic interpretation for (fT3")) 
follows from the connection between ijj^ik^v) and the hypergeometric distribution. Suppose we have an urn containing 
B balls, B — k of which are white (unavailable blocks) and k are black (available blocks). ifjh(k,v) is the probability of 
selecting without replacement h balls, of which v — k < h are white. 

Consider now another experiment, namely selecting without replacement all balls from the urn. Let J w be the round 
in which the w th white ball is selected, < J\ < J2 < ... < Js-k < B + 1. Let Wb be the number of black balls 
selected between the b th and b + 1 th white, plus one. Equivalently, Wb is the number of elements in the set S — {n £ 
{0, . . . , B} : exactly b white balls are selected before ball n}. Then Wo = Ji, W\ = J% — J\, . . ., Wk-i = Jk — Jk-li an d 
W B ^ k =B + 1-Ji. Clearly, W b = B + 1. By symmetry, E[W b ] = (B + l)/(B - k + 1) (0 < b < B - k). 

Now let v — k be given, v — k £ {0, 1, ... , v}. Let 1^ be the indicator equal to 1 if exactly v — k white balls are selected 
among the first ft, balls. Note that -E/[lfc] = iph(k,v). Also, from the definition of W v -k, we have W v -k = J2h=o^- h - 
Therefore, E[W v - k ] = £f =0 E[l h ] = ELo Mk, v) = (B + 1)/(B -k + 1). 

E Proof of Theorem 14.11 

Replacing (fT3f into p2|) yields, 



l/B n , n>l,v = (i) 

E V k=oPn-i(k)(B + l)/(B(B-k + l)), n>l,0<v<B («) 

Pn(v)=( 1-Y££p»(!), n>l,v = B {Hi) (32) 

1, n = 0, v = (iv) 

0, n = 7 v^0 (v) 

In case (n), Pn (v) - p n - X {v){B + 1)/{B(B -v + l))+ Y,"k=lPn-i{k){B + 1)/{B(B - k + 1)) = p n -i(v)(B + 1)/(B(B - 
v + 1)) +p n (v — 1) which yields (flip)) . 



F Expression of p n (v) When 7 = 00 

If 7 = 00, the closed-form expression for p n (v) is p n (v)=(f) (^)™ELo CK -1 )^ 5 + l + i) - ") n > 1,S > u > 0, 
and p„(B) = 1 - ^JoPnM- 

G Self-Sustainability with Heterogeneous Download Times 

We now present an efficient algorithm to compute swarm self-sustainability if the block download times, [ih-, are not 
necessarily equal for all h, 1 < h < B. 

Denote by ah,r(v) the probability that v blocks are available among the peers conditioned on the presence of r users 
in layer h and all users being in layers /i up to B, 

a h<r (v) = P(V = v n h = r;rii = Q,i < h), 0<h<B, 0<r<M (33) 
M is the maximum number of users per layer. The following recursion correctly computes ah,ri 0<h<B,0<r< M, 



}ah,r-i(i)iph(i,v), r>l,h^B (i) 



ah,r{v)= < 



i=0 

E^Lo a ''+i,r(vW+iM, r = 0,h^B (ii) ^4) 

1, r = 0,h = B,v = (in) 

0, r = 0,h= B,v t^O (iv) 

We approximate A by its truncated version, A M \ considering only states in which there are up to M users in each layer 



of the system. Recall that N is the maximum number of users in the system, N = O(BM). A naive use of (|34f yields an 
algorithm to compute A (M) in time 0(B 3 M) = 0{B 2 N). 

In what follows, we show how to compute A^ M ^ in O(NBlogB). For this purpose, we re-write the sum in (|34|) (i) as 
a convolution, which is computed in time 0(B log B). Let 

a h , r -i(k) = a h , r -i(k)k\(B-k)\, 0<k<B and $ h (v - k) = l/((v-k)\(h-(v-k))\), 0<k<v 

Then, the following convolution correctly computes (|34[) fi). r — a^.r-i 55 Vv 
ah r is obtained from 

(a h , r (v))/((v-h)\(B-v)\( B h )), 0<v<B-l 
ahA v ) = \ ( 35 ) 



H Derivation of ( TTHT ) 



We now show that (IT8l) follows from (|T7|) . The case v = B follows trivially. For v < B, we show that (fT8|) satisfies (flTl) . 
i.e., p n (u) -p n _i(w)(l/(i? -« + !)) = p„(v - 1), 



p n (u)-p n _i(u)(l/(5-u + l)) 



^(^(-l^S-w + Z + l)""- 



£> \ x IV — 1 



5 - I! + 1 



(-l) I (B-» + f + 2)-"=p B (i;-l) 



I Derivation of ([19 



We now derive (19]) from p n (B) = J2f =0 (f )(-!)'(* + l) _n - 

= EEp»( B ) e " (B+1)p (( s + 1 )'°) ,l A 



(36) 



n=0 i=0 

5 >B 



2=0 



E N 



i e ~Sp e (s+i) P /(;+i) 



^ e -(s+i)p/(i+D(( B + + i))«/ n ! 



n=0 



J Probability that / tagged blocks are unavailable 

The probability that I tagged blocks are unavailable, when 7 = /i, is exp ^— p ^^ | ^ 1 M ^. 

Let tth, r be the probability that there are I tagged unavailable blocks, conditioned on all users being in stages h,h + 
1, . . . , B — 1,B, stage h having r users (similar in spirit to recursion presented in Appendix lG|). ah is the corresponding 
metric, but with no conditioning on the number of users in layer h. Given I, the following recursion yields a.h,r- 



ah,r= < 



(i) 1 



h=B, r=0 
h < B,r > 1 



i=0 B-i 

Y^m=o a h+i, m Kh+i{m) h<B,r = 



(37) 



Our goal is to find the expression of athfi, 



a/i,0 = E a h+l,mTTh+l{m) 



From |[37|1(m). 



n 



B-l-i 
B-i 



Otfi+l.m — Q!ft+1,0 



n 



B-l-i 
B-i 



(38) 



(39) 



Hence, replacing (|39|) into ([38]). 

/t-t B — l — i\ m e~ p p m r -A- £? — / — i i m e~ p p rn 

«M = E a w-i.o(n-B3rJ = a ' i+i -°z: [Il-B^-J (40) 



m— i— m— 2—0 



0M-i 



Let 



Therefore, 



D , ■ (B-h\ 
i=0 V// 

«M = Qih+1,0 > (/3^+ip) m — - = a h+1 . Q e x ) (/3 h+1 p) m — = a h+lfl e x (42) 

* — ' ml L — ' ml 

m—0 m—Q 

where —p + x = —0 h+1 p, 

x = p - Ph+iP = p(l - Ph+i) (43) 

So, 

(B-h-l\ 

a h fl = cth+1,0 exp(-p(l - Ph+i)) = oth+ifi exp(-p(l -g- — )) (44) 

v i ) 

The solution of recursion (l44l is 



(i) 1, h=B 
®h,o={ (B -a-i\ (45) 

(«) nf^expC-pCl-^i)), 0<fc<iJ 



If ft = 0, 



-B-l AB-a-l\ /B-l \ /B-l /B-a-l\ \ 

« ,o = n ex p(--°(i - Sir 1 )) - ex p(-p( E 1 - E )) ( 46 ) 

a=0 uJ \a=0 / \a=0 V/J / 

The hockey stick pattern of the Pascal triangle, Yla=o ( S_ f _1 ) = G+i)' together with (|46p . yields 

a 0fi = ex P (-„ [ B - { -§-))= «p (-p (^rrr^)) (47) 

K Integrating the User Dynamics into the Lower Layer 

Next, we provide an alternative description of our model which explicitly characterizes the evolution of the signature of 
each user in time. 

Assume that each user selects uniformly at random one of the blocks among those that he does not have. Each user then 
contacts other users uniformly at random for opportunities to download the selected block. The time between contacts 
initiated by a specific (tagged) user is characterized by a Poisson process of rate p. If the contacted user has the requested 
block, it is transfered immediately. Otherwise, the tagged user is instantaneously re-directed to the publisher, who then 
transfers the block. All transfers are assumed to be instantaneous. 

The system described above can be fully characterized using the signatures introduced in our lower layer model. Let 



S be the lower layer state, i.e., S is a {0, l} Bn bit vector in case there are n users in the system, (see §3.2[) . rii(S) is the 
number of users in stage i when the system state is S. Let h(S, u) be the stage of user u when the system state is S. 

Let be the union of the state spaces of the lower layer model corresponding to an the upper layer in which there 
are n users in the system, fl n = Un-|n|=n ^|n|- Let be the union of all possible state spaces of the lower layer model, 
n = U°^o Oj. Starting from state 5", let To be the state resulting from an arrival and T Ui {, be the state resulting from 
user u concluding the download of block b. 

Let denote a vector of lenght B with all elements equal to zero; ej denotes a vector with its i th element equal to 1 and 
all other elements equal to zero; S\u denotes the bit vector S after the removal of the bits corresponding to user i (bits 
[u — \)B + 1 up to uB). We assume that after a user leaves the system, the remaining users are re-indexed accordingly. 



Eh(S,u) <B-l 



If h(S,u) = B-1 



Tq : O n — > O n _|_i 

S -> 5.0 (48) 

(49) 



S -> S + e B{u _ 1)+b (50) 



S -> S\u (51) 

Note that transformation T Ut b explicitly characterizes the evolution of the signature of each user in time. (|50p corre- 
sponds to the download of a block and a signature update whereas (|51j) corresponds to a download completion. 
The entries of the generator matrix Q = (q(S, S') : S,S' £ SY) are 

q(S,T (S)) = A (52) 

q(S,T u>b (S)) = fi/(B-h{S,u)) (53) 



Equations (|52I) - (I53I) provide an integrated description of the upper and lower layer models. This model description is 
similar in spirit to [lOj . While Hajek and Zhu |10) are interested in studying the stability of a system that resembles the 
one described above, the system considered in this paper is always stable, and we are interested in its self-sustainability. 



